智能论文笔记

Effects of Annotations' Density on Named Entity Recognition Models' Performance in the Context of African Languages

Manuel A. Fokam

分类：自然语言处理 | 人工智能

2022-08-09

非洲语言最近是几项自然语言处理（NLP）研究的主题，这导致其在该领域的代表性大大增加。但是，在评估模型在诸如命名实体识别（NER）等任务中的性能时，大多数研究往往比数据集的质量更多地关注模型。尽管这在大多数情况下效果很好，但它并不能说明使用低资源语言进行NLP的局限性，即我们可以使用的数据集的质量和数量。本文根据数据集质量提供了各种模型的性能的分析。我们根据某些非洲NER数据集的每个句子的实体密度评估了不同的预训练模型。我们希望这项研究能够改善在低资源语言的背景下进行NLP研究的方式。

translated by 谷歌翻译

Assessment of creditworthiness models privacy-preserving training with synthetic data

Ricardo Muñoz-Cancino , Cristián Bravo , Sebastián A. Ríos , Manuel Graña

分类：机器学习

2022-12-31

Credit scoring models are the primary instrument used by financial institutions to manage credit risk. The scarcity of research on behavioral scoring is due to the difficult data access. Financial institutions have to maintain the privacy and security of borrowers' information refrain them from collaborating in research initiatives. In this work, we present a methodology that allows us to evaluate the performance of models trained with synthetic data when they are applied to real-world data. Our results show that synthetic data quality is increasingly poor when the number of attributes increases. However, creditworthiness assessment models trained with synthetic data show a reduction of 3\% of AUC and 6\% of KS when compared with models trained with real data. These results have a significant impact since they encourage credit risk investigation from synthetic data, making it possible to maintain borrowers' privacy and to address problems that until now have been hampered by the availability of information.

translated by 谷歌翻译

Smart Face Shield: A Sensor-Based Wearable Face Shield Utilizing Computer Vision Algorithms

Manuel Luis C. Delos Santos , Ronaldo S. Tinio , Darwin B. Diaz , Karlene Emily I. Tolosa

分类：计算机视觉

2022-12-18

The study aims the development of a wearable device to combat the onslaught of covid-19. Likewise, to enhance the regular face shield available in the market. Furthermore, to raise awareness of the health and safety protocols initiated by the government and its affiliates in the enforcement of social distancing with the integration of computer vision algorithms. The wearable device was composed of various hardware and software components such as a transparent polycarbonate face shield, microprocessor, sensors, camera, thin-film transistor on-screen display, jumper wires, power bank, and python programming language. The algorithm incorporated in the study was object detection under computer vision machine learning. The front camera with OpenCV technology determines the distance of a person in front of the user. Utilizing TensorFlow, the target object identifies and detects the image or live feed to get its bounding boxes. The focal length lens requires the determination of the distance from the camera to the target object. To get the focal length, multiply the pixel width by the known distance and divide it by the known width (Rosebrock, 2020). The deployment of unit testing ensures that the parameters are valid in terms of design and specifications.

translated by 谷歌翻译

A perspective on the use of health digital twins in computational pathology

Manuel Cossio

分类：人工智能

2022-11-30

A digital health twin can be defined as a virtual model of a physical person, in this specific case, a patient. This virtual model is constituted by multidimensional data that can host from clinical, molecular and therapeutic parameters to sensor data and living conditions. Given that in computational pathology, it is very important to have the information from image donors to create computational models, the integration of digital twins in this field could be crucial. However, since these virtual entities collect sensitive data from physical people, privacy safeguards must also be considered and implemented. With these data safeguards in place, health digital twins could integrate digital clinical trials and be necessary participants in the generation of real-world evidence, which could positively change both fields.

translated by 谷歌翻译

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Teven Le Scao , Angela Fan , Christopher Akiki , Ellie Pavlick , Suzana Ilić , Daniel Hesslow , Roman Castagné , Alexandra Sasha Luccioni , François Yvon , Matthias Gallé

分类：自然语言处理

2022-11-09

Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access language model designed and built thanks to a collaboration of hundreds of researchers. BLOOM is a decoder-only Transformer language model that was trained on the ROOTS corpus, a dataset comprising hundreds of sources in 46 natural and 13 programming languages (59 in total). We find that BLOOM achieves competitive performance on a wide variety of benchmarks, with stronger results after undergoing multitask prompted finetuning. To facilitate future research and applications using LLMs, we publicly release our models and code under the Responsible AI License.

translated by 谷歌翻译

Reducing Down(stream)time: Pretraining Molecular GNNs using Heterogeneous AI Accelerators

Jenna A. Bilbrey , Kristina M. Herman , Henry Sprueill , Soritis S. Xantheas , Payel Das , Manuel Lopez Roldan , Mike Kraus , Hatem Helal , Sutanay Choudhury

分类：机器学习

2022-11-08

The demonstrated success of transfer learning has popularized approaches that involve pretraining models from massive data sources and subsequent finetuning towards a specific task. While such approaches have become the norm in fields such as natural language processing, implementation and evaluation of transfer learning approaches for chemistry are in the early stages. In this work, we demonstrate finetuning for downstream tasks on a graph neural network (GNN) trained over a molecular database containing 2.7 million water clusters. The use of Graphcore IPUs as an AI accelerator for training molecular GNNs reduces training time from a reported 2.7 days on 0.5M clusters to 1.2 hours on 2.7M clusters. Finetuning the pretrained model for downstream tasks of molecular dynamics and transfer to a different potential energy surface took only 8.3 hours and 28 minutes, respectively, on a single GPU.

translated by 谷歌翻译

Efficient Methods for Natural Language Processing: A Survey

Marcos Treviso , Tianchu Ji , Ji-Ung Lee , Betty van Aken , Qingqing Cao , Manuel R. Ciosici , Michael Hassid , Kenneth Heafield , Sara Hooker , Pedro H. Martins

分类：自然语言处理

2022-08-31

从有限的资源中获得最大收益可以进步自然语言处理（NLP）研究和实践，同时保守资源。这些资源可能是数据，时间，存储或能源。NLP的最新工作从缩放率产生了有趣的结果。但是，仅使用比例来改善结果意味着资源消耗也会扩展。这种关系激发了对有效方法的研究，这些方法需要更少的资源才能获得相似的结果。这项调查涉及NLP效率的方法和发现，旨在指导该领域的新研究人员并激发新方法的发展。

translated by 谷歌翻译

A Spanish dataset for Targeted Sentiment Analysis of political headlines

Tomás Alves Salgueiro , Emilio Recart Zapata , Damián Furman , Juan Manuel Pérez , Pablo Nicolás Fernández Larrosa

分类：自然语言处理

2022-08-30

几项作品已经研究了主观文本，因为它们可以在用户中引起某些行为。大多数工作都集中在社交网络中的用户生成的文本上，但是其他一些文本也包括对某些主题的观点，可能会影响政治决策期间的判断标准。在这项工作中，我们解决了针对新闻头条领域的有针对性情绪分析的任务，该领域由主要渠道在2019年阿根廷总统大选期间发布。为此，我们介绍了1,976个头条新闻的极性数据集，该数据集在2019年选举中以目标级别提及候选人。基于预训练的语言模型的最先进的分类算法的初步实验表明，目标信息有助于此任务。我们公开提供数据和预培训模型。

translated by 谷歌翻译

Training a T5 Using Lab-sized Resources

Manuel R. Ciosici , Leon Derczynski

分类：自然语言处理

2022-08-25

在大型数据集上培训大型神经语言模型是资源和时间密集型的。这些要求造成了进入的障碍，其中资源较少的人无法建立竞争模型。本文介绍了各种技术，以使（a）使用适中的研究实验室可能拥有的资源训练大型语言模型，以及（b）在合理的时间内训练它。我们为从业人员提供具体的建议，我们通过案例研究来说明这一点：丹麦的T5模型，第一种语言。

translated by 谷歌翻译

HTML版本

Advantages in Using a Stock Spring Selection Tool that Manages the Uncertainty of the Designer Requirements

Manuel Paredes , Marc Sartor , Cédric Masclet

分类：人工智能

2022-08-04

本文分析了使用管理设计师要求不确定性的股票弹簧选择工具的优势。首先，描述了手动搜索及其主要缺点。然后，提出了计算机辅助弹簧选择工具，该工具执行所有必要的计算，以从数据库中提取最合适的弹簧。该算法使用多标准分析和模糊逻辑分析了具有间隔值的数据集。列出了两个示例，分别进行了手册和辅助搜索。他们不仅显示了使用辅助搜索的结果明显更好，而且还可以帮助设计师轻松，精确地详细说明其规格，从而提高设计过程的灵活性。

translated by 谷歌翻译